Performance Improvement in Estimating Subjective Agedness with Prosodic Features

نویسندگان

  • Nobuaki Minematsu
  • Mariko Sekiguchi
  • Keikichi Hirose
چکیده

In this paper, we propose a technique which automatically estimates speakers’ agedness only with acoustic, not linguistic, information of their utterances. This method is realized by integrating GMM(Gaussian Mixture Model)-based speaker recognition techniques with modules for calculating prosody-based agedness scores. We firstly divided speakers of two databases, JNAS and S(senior)-JNAS, into two groups by listening tests. One group has only the speakers whose speech sounds so aged that one should take special care when he/she talks to them. The other group has the remaining speakers of the two databases. After that, each speaker group was modeled with GMM. Experiments of automatic identification of the speaker group showed the correct identification rate of 91%. To improve the performance, two prosodic features were considered, i.e, speech rate and local perturbation of power. Using these features, the identification rate was raised up to 95%. Finally, using scores calculated by integrating the GMM and the prosodic modules, experiments were carried out to automatically estimate speakers’ agedness. The results showed high correlation between speakers’ agedness estimated subjectively by humans and the automatically calculated scores with the proposed method.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

E ectiveness of Prosodic Information in Dependency Analysis of Japanese Sentences

This paper is concerned with measuring the amount of syntactic information contained in prosodic features of read Japanese sentences. Five prosodic features were chosen, and statistical relation between those features and interphrase dependency distance was estimated from a speech database. Then a number of experiments on dependency analysis of Japanese sentences were conducted with the minimum...

متن کامل

The effect of bilateral subthalamic nucleus deep brain stimulation (STN-DBS) on the acoustic and prosodic features in patients with Parkinson’s disease: A study protocol for the first trial on Iranian patients

Background: The effect of subthalamic nucleus deep brain stimulation (STN-DBS) on the voice features in Parkinson’s disease (PD) is controversial. No study has evaluated the voice features of PD underwent STN-DBS by the acoustic, perceptual, and patient-based assessments comprehensively. Furthermore, there is no study to investigate prosodic features before and after DBS in PD. The curren...

متن کامل

Using system and user performance features to improve emotion detection in spoken tutoring dialogs

In this study, we incorporate automatically obtained system/user performance features into machine learning experiments to detect student emotion in computer tutoring dialogs. Our results show a relative improvement of 2.7% on classification accuracy and 8.08% on Kappa over using standard lexical, prosodic, sequential, and identification features. This level of improvement is comparable to the ...

متن کامل

Comparing prosodic models for speaker recognition

Recently, speaker verification systems using different kinds of prosodic features have been proposed. Although it has been shown that most of these speaker verification systems can improve system performance using score-level fusion with stateof-the-art cepstral-based systems, a systematic comparison of the prosodic modelling algorithms used in these prosodic systems has not yet been performed....

متن کامل

Estimating the Sincerity of Apologies in Speech by DNN Rank Learning and Prosodic Analysis

In the Sincerity Sub-Challenge of the Interspeech ComParE 2016 Challenge, the task is to estimate user-annotated sincerity scores for speech samples. We interpret this challenge as a ranklearning regression task, since the evaluation metric (Spearman’s correlation) is calculated from the rank of the instances. As a first approach, Deep Neural Networks are used by introducing a novel error crite...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002